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Question 6 


Intent of Question 





The primary goals of this question were to assess students’ ability to (1) implement simple random 
sampling; (2) calculate an estimated standard deviation for a sample mean; (3) use properties of variances 
to determine the estimated standard deviation for an estimator; (4) explain why stratification reduces a 
standard error in a particular study. 


Solution 
Part (a): 


Peter can number the students from 1 to 2,000 and then use a calculator or computer to generate 100 
unique random numbers between 1 and 2,000 without replacement. If non-unique numbers are 
generated, the repeated numbers are ignored until 100 unique numbers are obtained. The students 
whose numbers correspond to the randomly generated numbers are then selected for the sample. 


Part (b): 


The estimated standard deviation of the sampling distribution of the sample mean is: 
s 4.13 


==, or 
4100 


= 0.413. 
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Part (c): 


2 
The variance of Rania’s estimator is (0.6)? Var(X,) + (0.4)? Var (2) where Var (X,) = ct represents 
f 


qQ 


2 
m 


the variance of the point estimator for females and Var (Xx a =n. represents the variance of the 
m1 


point estimator for males. 


The estimated standard deviation is the square root of the variance. Using the respective sample 
standard deviations s; and s,, for the population parameters, Rania’s estimate is calculated as: 





2; 
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= V0.01944 + 0.01972 = 0.198. 


Part (d): 


The comparative dotplots from Rania’s data reveal that the distribution of the number of soft drinks for 
females appears to be quite different from that of males. In particular, the centers of the distributions 
appear to be significantly different. Additionally, the variability of values around the center within 
gender in each of Rania’s dotplots appears to be considerably less than the variability displayed in the 
dotplot of Peter’s data. Rania’s estimator takes advantage of the decreased variability within gender 
because her data were obtained by sampling the two genders separately. Peter’s estimator has more 
variability because his data were obtained from a simple random sample of all the high school 
students. 
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Question 6 (continued) 


Scoring 
Parts (a), (b), (c), and (d) are scored as essentially correct (E), partially correct (P), or incorrect (I). 
Part (a) is scored as follows: 


Essentially correct (E) if the response provides a correct sampling procedure to obtain a simple random 
sample that includes sufficient information such that two knowledgeable statistics users would 
implement equivalent methods. 


Partially correct (P) if the response provides a plausible sampling procedure for obtaining a simple 
random sample but does NOT include sufficient information as described for E. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes 

e If computer-generated random numbers are used, an explicit statement about ignoring repeats is 
needed for sufficient information. (Sampling until 100 students are obtained conveys the idea of 
ignoring repeats.) 

e If atable of random numbers is used without adequate specification of the numbers to be used (for 
example, 0001 to 2,000 or a different set of 2,000 four-digit numbers), the response is scored as 
incorrect (I). 

e If atable of random numbers is used with adequate specification of the numbers to be used, as 
described above, a statement about ignoring repeats AND a statement about ignoring numbers not 
in the specified range are both needed to earn an E. If either or both are missing, the response is 
scored as partially correct (P). (The statement about ignoring repeats or the statement about 
ignoring numbers not in the range of specification may be implicit. For example, “... continue this 
procedure until 100 students are selected.”) 

e The procedure of using names or numbers on slips of paper in a hat must indicate some 
randomization in selection (for example, mixing the slips of papers or randomly choosing from the 
hat). 

e Ifthe sampling procedure does not produce a simple random sample, then the response is scored 
as incorrect (I). 


Part (b) is scored as follows: 
Essentially correct (E) if the correct calculation is performed AND sufficient work is shown. 
Partially correct (P) if the correct answer is provided but no work is shown. 
Incorrect (I) if the response does not meet the criteria for E or P. 
Notes 


e The correct formula (either in symbols or with correct numerical substitutions) is sufficient for 
shown work in either part (b) or part (c). 


e Aresponse with only the answer 0.413 with no work shown is scored as P (because dividing by the 
square root of 100 can be done mentally). 
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Question 6 (continued) 


Sufficient shown work with no calculation is scored as partially correct (P). For example, showing 
413 _. 
—— with no calculation. 


¥100 
Minor calculation errors can be ignored in part (b) or part (c) and considered when determining 
between two scores. 


The incorrect use of the notation o instead of s is considered a minor error in the investigative 
task and will not reduce the score in either part (b) or part (c). 


Part (c) is scored as follows: 


Essentially correct (E) if the response includes the following three components: 


1. Combining variances. 
2. Using weights (0.6 for females and 0.4 for males). 


3. Recognizing variability of sample means (variance divided by sample size), 
AND 


the response correctly combines these three components to produce the estimated standard deviation 
for Rania’s point estimator. 


Partially correct (P) if the response has at least two of the three components AND a reasonable attempt 
to combine these components to produce the (overall) estimated standard deviation for Rania’s point 
estimator. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes 


When part (c) is scored partially correct (P), the reasonableness of the numerical result can be 
considered in determining the overall score. 

The following are algebraically equal and provide the correct estimated standard deviation of 
Rania’s point estimator of 0.198: 








2 2 2 2 2 2 a 2 
22% 2 Sm =| 2 SF 25m = SF oo = [0st +104en 
(os 7 + (0.4) oa (0.6) 50 + (0.4) 7 (0.6) Too + (0.4) 100 = [00 


A response that combines si and sf to obtain a pooled standard deviation for Rania’s data, such 
as 











ga [te = D8? + 1m = DSi _ (coi + (39)(2.22)" _ 4 gage 
P (n, — 1) + (a, - 1) 59 + 39 ’ 

contains two of the three components: combining variances and using weights 
59 39 


( = 0.6 and 98 * 0.4) for part (c). Because pooling is a reasonable attempt to combine the 


components, the calculation of s, ~ 1.97786 is scored as partially correct (P). If the response 


Sp .. LO7786 
vn V100 
partially correct, because the pooling is a reasonable, but not the correct, combination of the three 
components to compute the sample standard deviation of the stratified sample mean. 


includes the correct third component, [ = 0197786) the response is still scored 
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Question 6 (continued) 
Part (d) is scored as follows: 


Essentially correct (E) if a reasonable justification is provided based on BOTH of the following two 
components: 
1. Smaller variability in responses for each gender in comparison with the variability in responses 
in Peter’s data, and 
2. Linkage (either explicit or implicit) between the smaller variability in responses for each gender 
and producing a smaller estimated standard deviation for Rania’s combined point estimator 
than Peter’s point estimator. 


Partially correct (P) if the benefit of the smaller variability in responses for each gender with 
stratification (homogeneity within each gender) for these samples is identified, but there is no 
reference to how it affects the estimated standard error for Rania’s point estimator, 

OR 
if the benefit of different centers with stratification (heterogeneity between genders) for the samples is 
identified, without reference to the resulting smaller variability in responses for each gender and how it 
affects the estimated standard error for Rania’s point estimator, 

OR 
if the response identifies that Rania’s combined data for males and females has smaller variability than 
Peter’s data. 





Incorrect (I) if the response does not meet the criteria for E or P. 


Notes 

e Comments about shapes of the distributions are extraneous and can be ignored. 

e General statements about the benefits of stratification without using the sample data in the 
dotplots are not sufficient. 


Each essentially correct (E) part counts as 1 point. Each partially correct (P) part counts as % point. 


4 Complete Response 

3 Substantial Response 
2 Developing Response 
1 Minimal Response 


If a response is between two scores (for example, 22 points), use a holistic approach to decide whether to 
score up or down, depending on the overall strength of the response and communication, especially in the 
investigative parts—part (c) and part (d)—of the response. 
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Question 6 Z 
Spend about 25 minutes on this part of the exam- . 
Percent of Section II score—25 - 


Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


6. Two students at a large high school, Peter and Rania, wanted to estimate y, the mean number of soft drinks that 
a student at their school consumes in a week. A complete roster of the names and genders for the 2,000 students 
at their school was available. Peter selected a simple random sample of 100 students. Rania, knowing that 
60 percent of the students at the school are female, selected a simple random sample of 60 females and an 
independent simple random sample of 40 males. Both asked all of the students in their samples how many soft 
drinks they typically consume in a week. 








(a) Describe a method Peter could have used to select a simple random sample of 100 students from the school. 
For peter to vtlect a mre yandom sample of (00 students From 
the school he could get a It of all 2000 students tet attend 
the schoo! umber the students tom ooo to 777. We o 
random intesers table, starting fom fhe fist We, move 
acyass the ‘teble tour aigifs of co HNC. Record the numbeR 
skier ond mumber over ity ond (hore roreqts. once = me 
fist |od numbers ore selected the students Covre sponding 
fo taser vumbeks will be the (00 stulerty of He Scmpe. 


any part of this page Is illegal. GO ON TO THE NEXT PAGE. 
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-Peter and Rania conducted their studies as described. Peter used the sample mean X asa point estimator for yu. . 
‘Rania used Rodi = (0.6) X female + (0.4)X male as a point estimator for “, where Xtemale is the mean of the 


sample of .60 females and X male is the mean of the sample of 40 males. 


Summary statistics for Peter’s data are shown in the table below.” 


- 


: Deviation 
Number of 
5.32 4.13 
Pe ee ae 
(b) Based on the summary statistics, calculate the estimated standard deviation of the sampling distribution 


(sometimes called the standard error) of Peter’s point estimator X. 


io. 





Summary statistics for Rania’s data are shown in the table below. 


Deviation 
Numberof [Female | 60 | 290 | 180 (| 
soft drinks 











(c) Based on the summary statistics, calculate the estimated standard deviation of the sampling distribution of 
Rania’s point estimator Xoveral] = (0.6)Xfemale + (0.4)X male. 


[00° (y+ co (BY = 


any part of this page Is lilegal. ; : GO ON TO THE NEXT PAGE. 
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A dotplot of Peter’s sample data is given below. 6 
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Comparative dotplots of Rania’s sample data are given below. 
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(d) Using the dotplots above, explain why Rania’s point estimator has a smaller estimated standard deviation 
than the estimated standard deviation of Peter’s point estimator. 


Ranias .pqint estimatoy has a Smaller estimated gtandard 
deviation than the estimated standard deviation of 
peters port estimator becawe Rania used two independém — 
random amples. cach vandum sample had different Stand ala 
deviotton ond sample size, poth which are smaller than 
peters dampPle dee and standard deviation. PCAWE gt the 
rule for vorricinres Netléd +o calculofe Ran ia's overall samfle 
standard dewation the small valyts would resuft in 
smal estingted standard deviation, 
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STATISTICS 6 B 
SECTION I 
Part B 
Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section II score—25 


‘81 


Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


6. Two students at a large high school, Peter and Rania, wanted to estimate , the mean number of soft drinks that 


a student at their school consumes in a week. A complete roster of the names and genders for the 2,000 students 
at their school was available. Peter selected a simple random sample of 100 students. Rania, knowing that 

60 percent of the students at the school are female, selected a simple random sample of 60 females and an 
independent simple random sample of 40 males. Both asked all of the students in their samples how many soft 
drinks they typically consume in a week. 


(a) Describe a method Peter could have used to select a simple random sample of 100 students from the school. 
(eler Com use Qa aided seas num ber generate ae 
| 4 Kets ‘Peleg. wall jerente a pamddom  wumber 
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\\ Peter and Rania conducted their studies as described. Peter used the sample mean X asa point estimator for yu. 
Rania used Xai = (0.6)X female ~ (0.4) X male as a point estimator for u, where Tani is the mean of the 
sample of 60 females and Xmale is the mean of the sample of 40 males. 


Summary statistics for Peter’s data are shown in the table below. 


= 
Deviation 
Number of 
5.32 4.13 
| tras | | 82] ats 


(b) Based on the summary statistics, calculate the estimated standard deviation of the sampling distribution 
(sometimes called the standard error) of Peter’s point estimator X. 


4.13 
ig (Le 


\YVo 





Summary statistics for Rania’s data are shown in the table below. 


Deviation 
Numberof | Female | 60 | 290 | 180 | 
soft drinks 












(c) Based on the summary statistics, calculate the estimated standard deviation of the sampling distribution of 
Rania’s point estimator Xoveral! = (0.6)X female + (0.4)X male. 





Neco. ° 9.6( 9-232) +04 (0.951) 


LOE p [o-2 ro] 


————— Ee GO ON TO THE NEXT PAGE. 
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63 A dotplot of Peter’s sample data is given below. 
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Comparative dotplots of Rania’s sample data are given below. 
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(d) Using the dotplots above, explain why Rania’s point estimator has a smaller estimated standard deviation 
than the estimated standard deviation of Peter’s point estimator. 


Tyan Moe detel-ts | Genales lend be ere less 


Prgrc§ 
Ss oft Aewnlcs aa wee | Smee Rana wie A aH 


Some Wot was 6 O Ye Coocale: Oee 8 4 Sos male, 
ee S bancdas A Aevrahvu. slo wd be ¢ mallern 

f lendardA  chevtah en whe Weed wo’ 
wee poate does regardless 


S iwpl< rondom sam le £ [Go 


o£ Ga ter « 


wa A ais ddd. GO ON TO THE NEXT PAGE. 


-16- 


© 2012 The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


6Cf = 
= STATISTICS OC} 
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Part B 
Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section II score—25 


Directions: Show all your work. Indicate clearly the methods you use, because you will be scored on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


6. Two students at a large high school, Peter and Rania, wanted to estimate yz, the mean number of soft drinks that 


a student at their school consumes in a week. A complete roster of the names and genders for the 2,000 students 
at their school was available. Peter selected a simple random sample of 100 students. Rania, knowing that 

60 percent of the students at the school are female, selected a simple random sample of 60 females and an 
independent simple random sample of 40 males. Both asked all of the students in their sanples how many soft 
drinks they typically consume in a week. 


(a) Describe a method Peter could have used to select a simple random sample of 100 students from the school. 


dor wald howe CHa each vane om tle 
es Oeageeros Ma oo 
rae Hen hope Wary i pattie ye 
Gear ath . Mme) an Hue 
i 
Yur tad cho. pr tx homes ee toc, 

Ke 2000 a ae lute a hot mye 
up He hain wh tre ak gal Hea draw 
ot (00 name. Yha uadd Pnhuoy 

rand fale lv. 


} 
ro Wad di GO ON TO THE NEXT PAGE. 


-14- 


© 2012 The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


Peter and Rania conducted their studies as described. Peter used the sample mean X asa point estimator for JL. 
Rania used Xoverall = (0.6)X female > (0.4) X male as a point estimator for 4, where Xtemale is the mean of the 


sample of 60 females and Xmale is the mean of the sample of 40 males. 


Summary statistics for Peter’s data are shown in the table below. 


=a 
Number of 
100 5.32 4.13 
| rane | 0 | saz] 


(b) Based on the summary statistics, calculate the estimated standard deviation of the sampling distribution 
eee eee 
(sometimes called the standard error) of Peter’s point estimator X. 


ys See (S- 
i - 2. » UW, 






Summary statistics for Rania’s data are shown in the table below. 


a 
Deviation 


Numberof | Female | 60. | 2.90 
soft drinks ee ee 







(c) Based on the summary statistics, calculate the estimated standard deviation of the sampling distribution of 
Rania’s point estimator Xoverall = (0.6)X female + (0.4)X male. 
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A dotplot of Peter’s sample data is given below. 
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Comparative dotplots of Rania’s sample data are given below. 
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(d) Using the dotplots above, explain why Rania’s point estimator has a smaller estimated standard deviation 
than the estimated standard deviation of Peter’s point estimator. 
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Question 6 
Overview 


The primary goals of this question were to assess students’ ability to (1) implement simple random 
sampling; (2) calculate an estimated standard deviation for a sample mean; (3) use properties of variances 
to determine the estimated standard deviation for an estimator; (4) explain why stratification reduces a 
standard error in a particular study. 


Sample: 6A 
Score: 4 


In part (a) the 2,000 students in the high school are assigned numbers from 0000 to 1,999. A random 
selection of 100 unique numbers from a random integers table is clearly described by ignoring repeated 
four-digit numbers and by ignoring the four-digit numbers greater than 1,999. The 100 unique random 
numbers are used to select the 100 students with the corresponding numbers. Thus, part (a) was scored as 
essentially correct. In part (b) the formula for the standard deviation of the sample mean is correctly 
identified by the appropriate statistics from the sample, and these statistics are used to calculate the 
correct standard error of Peter’s point estimator. Thus, part (b) was scored as essentially correct. The 
variances, weights, and sample sizes for female responses and male responses in Rania’s samples are 
correctly combined in part (c) to produce the correct estimated standard deviation of the stratified sample 
mean. Thus, part (c) was scored as essentially correct. In part (d) the standard deviations for each gender 
are identified as being smaller in Rania’s sample data than the standard deviation in Peter’s sample data. 
The two smaller variances for each gender are explicitly linked to the smaller sample standard deviation for 
Rania’s point estimator with the phrase “the rule for variances needed to calculate Rania’s.” Hence, 

part (d) was scored as essentially correct. With all four parts scored as essentially correct, the response 
earned a score of 4. 


Sample: 6B 
Score: 3 


In part (a) the response implicitly identifies each student with a number from 1 to 2,000. A random number 
generator is used to produce 100 unique numbers, and the students corresponding to the 100 unique 
numbers are selected for the simple random sample. Thus, part (a) was scored as essentially correct. In 
part (b) the formula for the standard deviation of the sample mean is correctly identified by use of the 
appropriate sample statistics, and the correct standard error of Peter’s point estimator is calculated. Thus, 
part (b) was scored as essentially correct. In part (c) the standard errors for the sample means for the 
female responses and male responses are correctly calculated, but the attempt to combine the standard 
deviations rather than the variances produces an incorrect estimate for the standard deviation of Rania’s 
point estimator. Thus, part (c) was scored as partially correct. The response in part (d) identifies the 
difference in sample means for the female and male responses, but the linkage of the reduced variability in 
the stratified sample to a smaller estimated standard deviation of the estimator for the mean is insufficient. 
Thus, part (d) was scored as partially correct. With two parts scored as essentially correct and two parts 
scored as partially correct, the response earned a score of 3. 
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Question 6 (continued) 


Sample: 6C 
Score: 2 


There are parallel solutions in the response for part (a). First, each name is randomly assigned a number 
from 1 to 2,000. Then 100 students are selected by choosing every 20th person on the randomized list. 
Because the list is randomized, any selection of 100 students results in a simple random sample. However, 
insufficient information is provided on the randomization of the list, so the solution was scored as partially 
correct. The second procedure of putting the names of the 2,000 students into a hat and randomly 
selecting 100 names from the hat is an essentially correct response. However, considering both responses, 
part (a) was scored as partially correct. In part (b) the formula for the standard deviation of the sample 
mean is correctly stated, and the appropriate sample statistics are used to calculate the correct standard 
error of Peter’s point estimator. Thus, part (b) was scored as essentially correct. In part (c) the standard 
errors for the sample means for the female responses and male responses are correctly calculated, but there 
is no attempt to combine the quantities to produce a single estimate for the standard deviation of Rania’s 
point estimator. Thus, part (c) was scored as incorrect. The response in part (d) identifies the difference in 
means for the female and male responses (heterogeneity between genders) but does not identify the 
variability for each gender being smaller in Rania’s sample data than the variability in Peter’s sample data. 
Moreover, the smaller sample sizes do not necessarily lead to smaller standard errors. Thus, part (d) was 
scored as partially correct. With one part scored as essentially correct, two parts scored as partially correct, 
and one part scored as incorrect, the response earned a score of 2. 
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